The Multimodal Listening Test in a High-Stakes Context: Gender-Neutral or not?

نویسندگان

چکیده

In this study, we used the Rasch measurement to investigate fairness of listening section a national computerized high-stakes English test for differential item functioning (DIF) across gender subgroups. The format inspired us whether items measure comprehension differently females and males. Exploring novel task types including multimodal materials such as videos pictures was especially interesting. Firstly, unidimensionality local independence data were examined preconditions DIF analysis. Secondly, authors explored performance female male students through analysis using measurement. uniform showed that 25 (out 30 items) displayed favored different subgroups, whereas effect size not meaningful. non-uniform revealed several exhibiting with moderate large size, favoring various ability groups. Explanations are hypothesized. Finally, implications study regarding development discussed.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Differential Item Functioning (DIF) in Terms of Gender in the Reading Comprehension Subtest of a High-Stakes Test

Validation is an important enterprise especially when a test is a high stakes one. Demographic variables like gender and field of study can affect test results and interpretations. Differential Item Functioning (DIF) is a way to make sure that a test does not favor one group of test takers over the others. This study investigated DIF in terms of gender in the reading comprehension subtest (35 i...

متن کامل

Assessing Assessment Literacy: Insights From a High-Stakes Test

This study constitutes an attempt to see what Language assessment literacy (LAL) isfor three groups of stakeholders, namely LAL test developers, LAL instructors, andLAL test-takers. The perceptions of the former group were derived from the contentanalysis of the latest version of the LAL test, and those of the latter 2 groups wereassessed through a survey designed by the researcher. Participant...

متن کامل

Interpreting the Validity of a High-Stakes Test in Light of the Argument-Based Framework: Implications for Test Improvement

The validity of large-scale assessments may be compromised, partly due to their content inappropriateness or construct underrepresentation. Few validity studies have focused on such assessments within an argument-based framework. This study analyzed the domain description and evaluation inference of the Ph.D. Entrance Exam of ELT (PEEE) sat by Ph.D. examinees (n = 999) in 2014 in Iran....

متن کامل

differential item functioning (dif) in terms of gender in the reading comprehension subtest of a high-stakes test

validation is an important enterprise especially when a test is a high stakes one. demographic variables like gender and field of study can affect test results and interpretations. differential item functioning (dif) is a way to make sure that a test does not favor one group of test takers over the others. this study investigated dif in terms of gender in the reading comprehension subtest (35 i...

متن کامل

Gender, Spatial Ability, and High-Stakes Testing

Researchers disagree on the relationships between gender, spatial ability and math achievement. Varied results from studies using different measures and populations fuel the debate. The present study adds to the gender-spatial-math literature by examining this relationship in the context of high-stakes math testing. Results indicate no gender effect on spatial ability or math achievement, and a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Listening

سال: 2022

ISSN: ['1090-4018', '1932-586X']

DOI: https://doi.org/10.1080/10904018.2021.1993446